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Abstract. We analyse the asymptotic behaviour of the probability of observing the expected number of suc- 
cesses at each stage of a sequence of nested Bernoulli trials. Our motivation is the attempt to give a genuinely 
frequentist interpretation to the notion of probability based on finite sample sizes. The main result is that the 
probabilities under consideration decay asymptotically as n where n is the common length of the Bernoulli 
trials. The main ingredient in the proof is a new fixed-point theorem for non-contractive symmetric functions 
of the unit interval. 



1 . Introduction and main results 

In a frequentist interpretation, the probability of an event is defined as its asymptotic relative frequency 
in a large number of independent experiments. In modern axiomatic probability theory, this interpretation 
is reflected in various forms of the law of large numbers. We refer the reader to any standard text book 
of probability theory for a technical discussion of these topics and to von Mises (1981) for a more philo- 
sophical account. Frequentism suggests that a sequence of n\ independent experiments with individual 
probability of success p (such as the tossing of a biased coin), would yield, on average, nip successes. This 
is reflected by the fact that the number of successes in this setup follows a Bin(n; , p) binomial distribution 
which assigns probability Pm^n, - (m,)^™'^ _ p) ni ~ m > to the event of observing m\ successes, and has 
expected value n\p. In a genuinely frequentist approach, these probabilities should be interpreted, again, 
as limits of relative frequencies. More precisely, if the sequence of n\ independent experiments were to be 
repeated, independently, «2 times, then, on average, one would observe «2Pm,,«i mns with m \ successes. 
In fact, for each m\ - 1, . . . ,m, the number of runs with exactly m\ successes follows a Bin(n2,/?m 1 ) ,« 1 ) 
binomial distribution, which is defined by the probabilities 
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of observing ni2 runs with m\ successes. Iteration of this process leads to the recursive definition 
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In the following we restrict our attention to the special case where the numbers are all equal to some 
n and the numbers are equal to the expected number of successes at stage k, i. e. ni\ = np, m-i - np„p. n , 
and so on. The numbers m k will not, in general, be integers, unless p is rational and n is sufficiently large. 
This could be remedied by considering the integer closest to instead, but we will not do that here. The 
subject of the paper is an asymptotic analysis of the array of numbers p^ defined recursively by /?o,« = P 
and 

Pk,n = l H )(p k -l,n) nPk - 1 *(l-p k -l,n) n(1 - Pk - 1 *\ k>l. (1) 

Surprisingly, these numbers do not seem to have received any attention in the past. They arise very nat- 
urally, however. They are the probabilities that, in a nested series of Bernoulli experiments, the number 
of successes coincides at each stage with the expected number of successes. Classically, the binomial 
coefficient rj is defined for positive integers m < n by the formula n\/m\(n - m)\. We extend this defin- 
ition to the case of real numbers by replacing the factorials in the denominator by Gamma functions, i. e. 
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= n\/T{a + 1)F(« - a + 1), < a < n. For two sequences a„, b„ of positive real numbers we write 
a„ ~ b„ if lim^^oo a n /b n = 1. 

With the following result we initiate the study of the probabilities p^„. 

Theorem 1.1. For every positive integer k and every p e (0, 1), there exist and /3k such that pk. n ~ 
ak{2nnY^ k as n — > oo. The numbers at are given by — [p(l — p)Y~ 1 ^ 2 ^. The rates /3k do not depend on 
the initial value po t „ — p and are given by /3k — [1 - (— 1 /2) k ] /3; in particular, ak and /3k converge to 1 and 
1/3, respectively. 

Interestingly, the rates /3k are related to the well-known Jacobsthal numbers Jk (OEIS A001045) via 
2 k /3k = Jk- The trivial cases p G {0, 1} are easily dealt with separately and are seen to lead to pk, n = 1. In the 
next result we look at the array (pk. n ) from a different angle and consider the case where k tends to infinity 
while n is held constant. 

Theorem 1.2. For every positive integer n and every p e (0, 1), the probabilities pk,„ converge, as k — > oo, 
to a limit p n € (0, 1 ). This limit is independent of p and is characterised by being the unique solution of the 
fixed-point equation 

Vn4 H ) P 7"(1-Pn) na - P " ) . (2) 

Furthermore, p n ~ (InnY 1 ^, as n — > oo. 

2. Proofs 

In this section we present proofs of Theorem 1.1 and Theorem 1.2. We will repeatedly use Stirling's 
approximation for factorials (or the Gamma function). 

Lemma 2.1 (Stirling's approximation). For every positive integer n, the factorial n\ satisfies 

V2W 7+I/ V" < n\ < en" +1/ V", and n\ ~ VW ,+1/ V". (3) 

An analogous approximation holds for the Gamma function T. In particular, for every positive integer n 
and every positive rational number a not exceeding n, the binomial coefficient rM satisfies 

V2W ,+I/2 e-" ln\ en" +1/ V" 
< < (4) 
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Proof. Immediate consequences of Robbins (1955). □ 

We now give the proof of our first main result. 

Proof of Theorem 1.1. Induction. We first consider the base case. Here, Stirling's approximation (Lemma 2.1) 
applied to Eq. (1) with k — 1 shows that 

1 

P\,n — , 
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and thus p\ M ~ a\(2nny^ 1 with ot\ — 1/ yjp(l — p) and (3\ — 1/2. Now, assuming that pk-\. n ~ 
<Xk-\(2nnyP k - x , we prove the corresponding statement for p^„, k > 1. The same approximation as before, 
applied to Eq. (1), yields 

p k , ~ 1 — ~ -^(2nnT^I\ 

■^2npk-i,n(i - Pk-i,n) yn ya k -\ 

which proves the first part of the theorem. It also shows that the numbers ak and /3k satisfy the recursions 

a k = ^^, A=(l~A-i)/2, 
which are easily solved by the reader's favourite method. □ 

In the next lemma, we will investigate the function P n : p i-> („'^)p" / '(l - p)"^^ p \ which governs the 
recursion (1). 

Lemma 2.2. For every positive integer n, the function P n is convex. 
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Proof. We will prove the stronger claim that P„ is log-convex. An easy calculation yields 

log P n (p) = — - n + np) + <A (1) (1 + «(1 - p))\ , 

n ap z p(l - p) 1 J 

where i/r (v) denotes the polygamma function of order v. See, for instance, Abramowitz and Stegun (1992, 
Section 6.4.) for an introduction to, and basic properties of, these functions. To prove log-convexity of P„, 
we thus need to argue that for all p e [0, 1], l/[p(l —p)] is greater than n [^^(1 + np) + if/ w (l + n(l - p))]. 
It is easily seen that both expressions are symmetric around p — 1/2 and possess power series expansions 
around that point. More precisely, one finds that 
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n f«A (1) (l + np) + + «(1 - p))\ =2 J] n ^\nl2 + 1) (p - l/2) 2i . 



i=0 

We can thus prove the claim by arguing that, for every positive integer n and every non-negative integer i, 
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It follows from Chen (2005, Theorem 1) (applied with m = 0) that 



^ 2i+1 »(«/2 + 1) < i2/,i 



(n/2 + 1) 2 ' +1 
and it thus suffices to prove that 
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This is obvious for i - 0. For positive i, it follows from the observation that the left-hand side is no less 
than the sum of the first three terms of its binomial expansion, and that this sum exceeds the right-hand 
side. □ 

In the next lemma we establish conditions for a function / : [0, 1] — > [0, 1] to have a unique fixed 
point, and for the fixed-point iteration to converge to this fixed-point from any starting value. Notably, we 
do not assume that / is a contraction. This fixed-point theorem will be the key ingredient in our proof of 
Theorem 1.2. 

Lemma 2.3. Let f : [0, 1] — > [0, 1] be a differentiable convex function such that /(0) = /(l) = 1 and 
lim^i f'(x) > 1. Then f has exactly one fixed point x* in (0, 1). If, moreover, \ f'(x*)\ < 1, / is symmetric 
around 1/2 and c := /(1/2) is such that either c < 1/2 and /(c) < 1/2, or c > 1/2, then, for every 
x € (0, 1), the sequence of function iterates (x, f(x), f(f(x)), . . .) converges to x*. 

Proof. Since f{\) = 1 and lim^_»i fix) > 1, there exists y e (0, 1), such that f(y) < y. If we define 
g : x h-> f{x) - x on [0,1], then g(Q) = 1 > and g(y) < 0, and thus, by the intermediate value theorem, 
there exists x* e (0, y), such that g(x*) = 0, i. e. x* is a fixed point /. Uniqueness follows from the 
observation that g(l) = and the fact that the convex function g can have at most two zeros. 

For the second part of the lemma we observe that the condition \f'(x*)\ < 1 implies via Banach's 
contraction principle (Granas and Dugundji, 2003, Theorem 1.1) that there exists a neighbourhood / of x* 
such that the iteration of / on / converges to x*. We can thus define J = (a,b) as the maximal interval 
containing / with the property that the iteration of / on J converges to x*. We will show that a — 0, b = 1. 
It is clear from the continuity of / that / is open. By definition, the image of J under / is contained in 
J, and since J is maximal the images of the boundary points a, b are not elements of J. This implies that 
f(a), f(b) € {a,b}. Since f(a) = a would imply the contradiction that a e {x*, 1], we have f(a) = b. We 
therefore need to rule out the possibility f(b) — a so that it will follow that f(b) = b and thus b = 1 and 
a — 0. We first consider the case c = /(1/2) < 1/2, which implies x* < 1/2 and 1/2 e J. To see this, 
we observe that the sequence f 2 '(l/2) of even function iterates is a decreasing sequence and bounded from 
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Figure 1. Illustration of the function P„ governing the recursion ( 1) and the fixed-point 
iteration considered in the proof of Lemma 2.3. In the picture, n — 10 and po, n = 0.15. 



below, and therefore converges to, say, y, which is a fixed point of / o /. The condition f(c) < 1/2 implies 
that f a f has only one fixed point in (0, 1) and since x* is a fixed point of / o /, it follows that y = p 
and 1/2 e J. Hence, by the symmetry of /, the maximal interval J - (a,b) is symmetric around 1/2 and 
one has f(a) = f(b). This implies that f(b) cannot equal a because otherwise b = f(a) = f(b) = a. The 
argument is similar for the case c > 1/2 except that no assumption on f(c) is necessary to conclude that 
1/2 e J. a 



An illustration of the function P„ and the fixed-point iteration considered in Lemma 2.3 is provided in 
Fig. 1 . We now proceed to the proof of our second main result. 



Proof of Theorem 1.2. We first observe that the functions P„ satisfy 



limP„0>) = lim />„(/>) = 1, 

p->0 P->1 



lim P' n (p) - -oo, 
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and apply Lemmas 2.2 and 2.3 to conclude that they possess a unique fixed point p n e (0, 1). In order to 
establish the convergence p^ — > p n , we need to verify the assumptions of the second part of Lemma 2.3. 

We first need to analyse in more detail the point (c„, P n (c n )), where c„ = P n {\/2) = 2~"["^. Forn = 1,2 
one obtains c\ — 2/n > 1/2 and = 1/2, respectively, and the conclusion follows. For n > 2, the numbers 
c n are less than 1/2 and we thus need to show that P„{c„) < 1/2. This is easily checked numerically for 
n — 3. For n > 3, it follows from an application of the estimate 
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to the central binomial coefficients followed by an application of Stirling's bounds (Lemma 2. 1 ) to ( ^j^j 
that 



Pn(Cn) = 
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To get from the first to the second line, we used that both ( n "J and its bound 2"/ \Jnnj2 are less than 2" 1 

and that x h-> ( 2 -"„J * s mcreasm g f° r * < 2" _1 . Showing that the last line in the previous display is less 
than 1 /2 for all n greater than three is a matter of basic, yet tedious, calculations. We next prove that 
\P' n (Pn)\ < 1- Differentiation of P„ and simplification of the resulting expression using p„ - P n (p„) shows 
that 

P'M = np„ (<A (0, (1 + n(l - Pn )) - <A (0) (1 + np n ) - log[(l - p)/p]) . (5) 
We will only consider the case n > 2. For n = 1,2 the claim can be checked numerically or dealt with by 
a straightforward adaptation of the arguments we are about to present. The proof so far has shown that, for 
n > 2, the fixed point p„ is less that 1 /2, and that P' n {p„) is thus negative. To show that the right-hand side 
of Eq. (5) exceeds -1, we use the bounds (Chen, 2005, Theorem 1) 



1 



1 



log x — — -r < i/f (0) (x) < log x - — 
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which are valid for positive x. We thus obtain 

1 +n 



< log(l + x) < X, 



P'niPn) > 



-1 + 1- 
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-1 + r n (p n ). 



2 1+ «(1 - p„) 6(1+ np„) 2 3 1+ np n 
It is an easy exercise to show that r„(p) is greater than zero for all p e (0, 1). 

The final statement of the theorem about the decay rate of p n as n tends to infinity follows again from 
applying Eq. (3) to Eq. (2), which yields 

1 

Pn 



yj2np n {\ - p„) yfn 



and thus p„ ~ (2m) 



■1/3 
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